Predicting function from sequence in venom peptide families
نویسندگان
چکیده
Toxins from animal venoms are small peptides that recognize specific molecular targets in the brains of prey or predators. Next generation sequencing has uncovered thousands of diverse toxin sequences, but the functions of these peptides are poorly understood. Here we demonstrate that the use of machine learning techniques on sequence-derived features enables high accuracy in the task of predicting a toxin’s functionality using only its amino acid sequence. Comparison of the performance of several learning algorithms in this prediction task demonstrates that both physiochemical properties of the amino acid residues in a sequence as well as noncontiguous sequence motifs can be used independently to model the sequence dependence of venom function. We rationalize the observed model performance using unsupervised learning and make broad predictions about the distribution of toxin functions in the venome. Keywords—Bioinformatics, machine learning, protein function prediction, venomics.
منابع مشابه
First transcriptome analysis of Iranian scorpion, Mesobuthus eupeus venom gland
Scorpions are generally an important source of bioactive components, including toxins and other small peptides as attractive molecules for new drug development. Mesobuthus eupeus, from medically important and widely distributed Buthidae family, is the most abundant species in Iran. Researchers are interesting on the gland of this scorpion due to the complexity of its venom. Here, we have analyz...
متن کاملFirst transcriptome analysis of Iranian scorpion, Mesobuthus eupeus venom gland
Scorpions are generally an important source of bioactive components, including toxins and other small peptides as attractive molecules for new drug development. Mesobuthus eupeus, from medically important and widely distributed Buthidae family, is the most abundant species in Iran. Researchers are interesting on the gland of this scorpion due to the complexity of its venom. Here, we have analyz...
متن کاملPartial Purification and Characterization of Anticoagulant Factor from the Snake (Echis carinatus) Venom
Objective(s): Snake venoms contain complex mixture of proteins with biological activities. Some of these proteins affect blood coagulation and platelet function in different ways. Snake venom toxin may serve as a starting material for drug design to combat several pathophysiological problems such as cardiovascular disorders. In the present study, purification of anticoagulation facto...
متن کاملCharacterization of cDNA sequence encoding for a novel sodium channel -toxin from the Iranian scorpion Mesobuthus eupeus venom glands
The venoms of Buthidae scorpions are known to contain basic, single-chain protein -toxins consisting of 60-70 amino acid residues that are tightly cross-linked by four disulfide bridges. Total RNA was extracted from the venom glands of scorpion Mesobuthus eupeus collected from the Khuzestan province of Iran and then cDNA was synthesized with the modified oligo (dT) primer and extracted total R...
متن کاملMolecular Characterization of a Three-disulfide Bridges Beta-like Neurotoxin from Androctonus crassicauda Scorpion Venom
Scorpion venom is the richest source of peptide toxins with high levels of specific interactions with different ion-channel membrane proteins. The present study involved the amplification and sequencing of a 310-bp cDNA fragment encoding a beta-like neurotoxin active on sodium ion-channel from the venom glands of scorpion Androctonus crassicauda belonging to the Buthidae family using r...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2014